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Abstract. The virtual observatory (VO) is a collection of interoperable data 
archives, tools and applications that together form an environment in which 
original astronomical research can be carried out. The VO is opening up new 
ways of exploiting the huge amount of data provided by the ever-growing number 
of ground-based and space facilities, as well as by computer simulations. This 
presentation summarises a variety of scientific results spanning various fields of 
astronomy, obtained thanks to the VO, after highlighting its structure, infras- 
tructure and various capabilities. 



1. Introduction 

A Virtual Observatory (VO) is a collection of inter-operating data archives and 
software tools which utilize the Internet to form an environment in which original 
research can be conducted. The VO is an international astronomical community- 
based initiative, whose main goal is to allow transparent and distributed access 
to data available worldwide. This is achieved by developing and applying com- 
mon standards and by ensuring the interoperability between the various data 
collections, tools and services. This allows scientists to discover, access, analyze, 
and combine observational and model data from heterogeneous data collections 
in a coherent and user- friendly manner. 



2. The VO concept in modern Astronomy 

The Virtual Observatory is a concept that arose from the need of modern-day 
astronomy to exploit the wealth of multi-instrument multi-wavelength datasets 
already available or to come very soon, in a quest to answer primordial questions 
such as that of the origin of the Universe, the formation and evolution of galaxies, 
the formation of stars and planets. 

Even though single observations provide instantanes of smaller or larger 
parts of the Universe at a given wavelength regime and/or at a corresponding 
look-back time, a more in depth understanding of the Universe, its history and 
future requires sampling the whole electromagnetic spectrum and studying the 
different epochs. And this need generates the problem of accessing increasingly 
large amounts of data in combination of the huge volumes accumulated over the 
years and their subsequent exploitation. 

Modern observational astronomy is dominated by large multi-wavelength 
surveys producing TBs of data every night. As an example of exploding data 
rates, Fig. Q] shows the past and projected data volume in the ESO archive (in 
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Figure 1. Past, present and projected future data volume in the ESO 
archive as a function of time (courtesy N. Fourniol; see also Eglitis & Suchar 
2009). 



TBs) as a function of time. These new data as well as the older data are stored 
in different archives all over the world, in different ways and formats, can be 
queried from different archive interfaces using different access methods and can 
be analysed with different tools and techniques. 

In this framework, the VO is simply an organised effort to provide the sci- 
ence users a centralised and uniform access to all these data without having them 
worrying about different formats and access mechanisms, while simultaneously 
providing all the necessary information, in the form of metadata, for a scientific 
usage of these data collections. 

The VO, however, is not a centralised database of all the astronomical data. 
It does not store the data - it merely provides access to them. And it does not 
play the role of data police. The quality of the stored data is a responsibility 
of the data providers and it is up to them to make sure that their quality and 
accompanying metadata are sufficient to ensure high-quality scientific research 
based on their usage. Last but not least, the VO is not a monolithic software 
system and there is no such thing as a "killer application". There is not (and 
there will never be) a unique tool dealing with all the aspects of imaging, spectral 
and tabulated data discovery, retrieval and subsequent analysis. Instead, it 
provides a suite of multi-purpose tools and services and it is up to the users to 
select those that suit them best, based on their needs. 

In its current status, the VO is in a transitional phase, going from devel- 
opment to operations. The amount of reduced data offered via VO services 
increases by the day, the tools, applications and services become more and more 
mature and VO-enabled science is now taking place in parallel with development, 
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indicating the most urgent needs in term of data availability, functionalities and 
standards. 

3. The various VO entities 

3.1. The International Virtual Observatory Alliance (IVOA) 

In order to insure the world-wide interoperability of applications and the devel- 
opment and adoption of common standards, the International Virtual Obser- 
vatory Alliance (IVOA) was formed in June 2002. The IVOA now comprises 
17 VO projects from Armenia, Australia, Brazil, Canada, China, the European 
VO (EURO-VO), France, Germany, Hungary, India, Italy, Japan, Russia, South 
Korea, Spain, the United Kingdom, and the United States. 

The IVOA is structured in Working and Interest Groups, each one respon- 
sible of a given area where key interoperability standards have to be defined and 
agreed upon. An IVOA standard could be described as a set of guidelines and 
rules for data providers or software developers who want their products (data 
or tools) to be VO compliant and interoperable. It describes the information 
necessary for a service or dataset to be VO compliant, both in terms of manda- 
tory metadata and implementation. IVOA standards are only relevant to data 
providers and software developers and are meant to be completely transparent 
to the scientific user. 

3.2. The European Virtual Observatory (EURO-VO) 

The European Virtual Observatory (EURO-VO) project aims at deploying an 
operational VO in Europe. The EURO-VO is active in the areas of IVOA 
standards definition, tools development, as well as dissemination for scientific 
usage. Its internal structure consists of three bodies, namely the Data Centre 
Alliance, the Technology Centre and the Facility Centre. 

• the Data Centre Alliance, an alliance of European data centres, that pro- 
vides the physical storage and publishes data, metadata and services to 
the EURO-VO using VO technologies; 

• the Technology Centre is a distributed organisation that coordinates a set 
of technological projects related to standards, systems and tools; 

• the Facility Centre, is the interface between the EURO-VO and the astro- 
nomical community. Its immediate aims are to introduce the VO capa- 
bilities to the European astronomers by means of workshops, Schools and 
tutorials and to support research carried out using the VO infrastructure. 

3.3. Other national VO initiatives 

Around the world, various national VO initiative are carrying out activities in 
support of data publishing and scientific usage of the VO. The US-VOQ for 
instance, offers a centralised portal where users can look for data in the VO as 



1 http:/ /www. us- vo.org/ 



4 



E. Hatziminaoglou 



well as exploit them with VO tools. The German VO (GAVO)B offers, among 
other things, access to theoretical data (e.g. Millenium simulation) through VO 
protocols. 



4. VO tools and services 

There is already a large variety of available VO-compatible applications for their 
immediate use to do science. Their level of maturity depends on a high degree 
on the level of maturity of the corresponding IVOA protocols and standards. As 
a consequence of the flexibility of the standards, as well as for historical reasons 
(i.e. many VO tools have evolved from existing pre-VO era applications) several 
of the applications might overlap in functionality. 

Broadly speaking, VO tools can be classified in two main categories, namely 
data discovery and data analysis, with most tools, however, suitable for both 
categories. A major advance towards interoperability, was the development of 
SAMP, the Simple Application Messaging Protocol, that allows applications 
to communicate with each other, building thus a complete VO environment 
put together by various smaller pieces. For an up-to-date list of VO tools, 
applications and services listed either alphabetically or by functionality, see the 
section Software in the EURO-VO pageqj. 

4.1. Data discovery 

There is a variety of VO tools and services allowing for imaging, spectra and 
catalogue discovery, such as Aladin0 developed by the CDS in France, Datas- 
cop^l provided by the US-VO, or VODesktopEI developed by AstroGrid, to name 
a few. They allow the user to look for all the reduced data available in all the 
VO-compliant archives and data repositories (or registries) for a source, browse 
through them, select those of interest and visualise in various forms. Most of the 
tools, in their current state and in interactive (i.e. non scripting) mode allow 
either for single-object multi-resource query or for a multi-object single-resource 
query. 

Fig. [2] shows the VODesktop interface. In the top left the list of resources 
grouped by user-defined criteria are shown. Each list consists of a variety of 
resources, shown in the right part of the window. For a selected resource, a 
detailed description is available at the bottom. The services provided by each 
resource are listed at the right of the resource name (e.g. cone search is marked 
by a green cone, a globe indicates a link to the web interface etc). The actions 
allowed for each selected resource are listed at the bottom left of the window. 



2 http:/ /www. g-vo.org/ 

3 http:/ / www.euro-vo.org/ software.html 

4 http: / / aladin.u-strasbg.fr/aladin.gml 

5 http: / /heasar cgsfc.nasa.gov / cgi-bin /vo/ datascope / init .pi 
6 http://www. astrogrid.org/wiki/Home/AboutAGDesktop 
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Filtering the resources is possible in a variety of ways, e.g. by service type, data 
Publisher, column and many more. 
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Figure 2. VODesktop main interface. For details on the various sections 
see text. 



Already existing t ools are being adapted to exploit t he VO world. An inter- 
esting case is VirGOQ (lHatziminaoglou fc Chereaufl2009l ) . the new ESO Archive 
Browser. Since its latest release (September 2009) VirGO comes with a collec- 
tion of pre-configured VO data resources (from ESO and external data centers 
like HST, CADC, ESA etc). A new configuration panel gives the possibility to 
add new ones in a quick and straight-forward way. And like every other VO 
tool, VirGO now "speaks" SAMP, allowing for communication with other VO 
applications. 



4.2. Data analysis 

Imaging data analysis The most advanced imaging analysis VO tool to date 
is Aladin. It is an interactive sky atlas that allows the user to search, browse 
through and visualise images retrieved from VO resources, perform astrometric 
and photometric calibration on them, superpose catalogues and instruments 
fields of view, create colour-composite images and many more. It also includes 
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a scripting capability for multi-object (by target name or coordinates) multi- 
resource search. Fig. [3] shows an Aladin main window, showing in a multi-view 
mode from right to left and top to bottom the following: NGC4826 Simbad 
objects in the field superposed; NGC6946 with all NED objects in the field; 
NGC1068 with a modified colour map; and M81 with the WFPC2 footprint on 
it. 
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Figure 3. Aladin main window, showing in a multi-view mode from right 
to left and top to bottom the following: NGC4826 with all Simbad objects in 
the field superposed; NGC6946 with all NED objects in the field; NGC1068 
with a modified colour map; and M81 with the WFPC2 footprint on it. 



Spectral analysis Today the VO has three spectral analysis tools to offer, all 
three of them very similar in functionalities, namely (in alphabetical order) 
SpecvieM0 provided by the US-VO, SPLAT- VO0, provided by AstroGrid, and 
VOSpecB, developed by ESA-VO. 



8 http: / / www.stsci.edu / resources / software -hardware / specview / 

9 http: / / star-www.dur.ac.uk / ^pdraper / splat / splat- vo / 

°http:/ /www.sciops.esa.int/index.php?project=ESAVO&page=vospec 
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All three tools allow the user to directly query all the available registries that 
contain ground-based and space spectral data and/or load local files. Other than 
the display capabilities, the tools provide various high level analysis modules 
such as line measurement and fitting, line identification etc. Fig. [4| shows a 
spectral energy distribution (SED) for NGC1068 built using FUSE, HST and 
ISO spectra, constructed with VOSpec. 
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Figure 4. VOSpec screenshot showing the SED of NGC1068 built using 
FUSE, HST and ISO spectra queried directly from relevant VO resources. 



Handling of tabular data A nice example of a tabular data handling tool is 
TOPCAlEj distributed as part of the AstroGrid tool suite. TOPCAT offers a 
wide variety of data visualisation functionalities, such as histogrammes, scatter 
plots, 3D and density plots; table and column metadata visualisation and editing 
(e.g. operation among columns, coordinate transformations); table format con- 
versions (supported formats include ascii, fits, VOTable, tsv and csv [tap- and 
coma-separated values tables, respectively], latex and more); cross-correlation 
capabilities of up to four tables simultaneously using various cross-correlation 



11 http: / / www.star.bris.ac.uk / ~mbt / topcat / 
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algorithms and many more. It also uses VO standards in order to query the 
VO-compliant resources and therefore allows the user to combine data residing 
on a local disk with data available through the VO. Thanks to the implemen- 
tation of SAMP, TOPCAT can also communicate with all other VO tools and 
applications. Fig. [5] shows a collection of screenshots of the above-mentioned 
capabilities. 




Figure 5. From left to right and top to bottom, a 3D plot (with RA, Dec 
and redshift as axes 5 a redshift histogram; the table column editor and a 
scatter plot (RA, Dec) using a third column (redshift) for colour coding. A 
subsample of the SDSS DR6 galaxy sample has been used for the creation of 
the plots. 

For tables larger than a few tens of thousands of rows, STILTS, the STIL 
Tool Set, a command-line tool is recommended. STILTS performs pretty much 
the same operations TOPCAT does but in a non-interactive, scriptable way. It 
is also recommended in case of operations having to be repeated several times. 

VOPlotB is an alternative to TOPCAT, developed by VO-India, capable of 
operating on tables in VOTable format. Aladin has also some table querying and 
handling capabilities. It allows to query the VO registry for all available tabular 
datasets based on coordinates or target names, and allows for cross-correlations. 
Cross-correlations of up to three tables simultaneously and up to 5000 objects at 
a time can also be performed using Open SkyQuer£3 developed by the US-VO. 



ht t p : / / www . st ar . br is . ac . uk/ ~ mbt /st ilt s/ 
http : / / vo .iucaa.ernet .in / ~ voi / voplot .htm 

http: / / www.openskyquery.net / Sky / sky site / OSQform / default .aspx 
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5. Recent scientific results 

The number of VO-enabled publications is now rising and the research carried 
out is covering all aspects on modern astronomy, from solar and stellar physics 
to extragalactic astronomy and cosmology. The main goal of the VO has been 
to allow for a seamless data access across archives; and even though VO tools 
and infrastructure are mostly used as a platform for data mining, other types of 
usage such as using VO tools for combined data visualisation, has been seen in 
the recent literature. 

Ever since the first, full y VO-enabled discov ery of 31 type 2 QSO candi- 
dates in the GOODS fields (|Padovani et al.1 [20041 ), many new discoveries were 
made with the help of VO applications and s ervices. An exploitation of the 
IPHAS catalogue (jGonzales-Solares et al.l 12008) in combination with 2MASS in 



an area of 1600 square degrees recently revealed 23 accreting young;, low-mass 



stars and brown dwarfs far from known star-forming regions (jValdivielso et al 



2009), suggesting the existence of hundreds of such objects in the area cov- 



ered by IPHAS. Similarly, a high proper motion L sub-dwarf was recently 
identified in the SP SS catalogue comb ining SDSS and 2MASS spectrophoto- 
metric information (ISivarani et al.ll2009l ). Taking advantage of the wide field 
of view of HRI / ROSA T and the availability of public data through VO tools, 
Caballero et al.l ( 20091 ) investigated the daily X-ray variability of 23 stars in the 



young a Orionis cluster. They detected unusual flares with durations of about six 
days, duration considerably larger than the ~24 hours usually observed in active 
stars residing in very young star forming regions. This last work is actually part 
of an ongoing investigation of the a Orionis region that also revealed large num- 
bers of previously unknown br own dwarf cand i dates by pasting together various 
large photometric catalogues riCaballerd 120081 : ICaballero fc Solanoll2008h . 



A major contribution of the VO to astronomy is the possibility it offers 
for retrieving, exploiting and visualising large datasets. These capab il ities have 
been extensively used lately to produce new results. In iFathi et al.1 f|2009h for 
instance, a sample of ^55000 Sa-Sd SDSS galaxies was selected, cross-correlated 
with HyperLeds0 in order to extract the morphological information and then 
g, r, z, z, J, H and if band images for each one of them were retrieved from SDSS 
and 2MASS using various VO services, with the aim of studying the scale length 
of disk galaxies as a function of redshift and morphology. Large catalogues of 
legacy value constructed with the help of the VO infrastructure have also been 
publishe d recently. One ex ample is the Large Quasar Astrometric Catalogue 
(LQAC: ISouchay et alJl2009h that contains 113666 quasars with photometry in 



u, 6, v, g, r, z, z, J, if as well as redshift and radio fluxes at 1.4, 2.3, 5.0, 8.4 and 
24 GHz, when available. This catalogue is the results of the combination of the 
12 largest quasar catalogues available in the literature. 

The volume of datasets available through VO protocols is increasing ex- 
ponentially, thanks to many large projects and surveys providing their data 
through VO-compliant interfaces but also thanks to the smaller projects that 
make the extra effort of adding a "VO-layer" to their data and publishing them 
using VO protocols. Recently, the Hubble Legacy Archive (HLA) made available 



3 http:/ /leda.univ- lyonl.fr/ 
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through VO interfaces all HLA NICMOS G141 grism spectra (jFreudling et al.1 
2008), enabling thus flexible querying of the 2470 available spectra. An example 



of feedback from the community to the VO was the effort off putting together 
a catalogue of quasar candida tes selected from t heir p hotometry extracted from 
the SDSS DR7, presented in iD'Abrusco et all f|2009h . The two methods used 
for the quasar candidates selection, namely the probabilistic principal surfaces 
and the negative entropy clustering, were subsequently implemented into the 
AstroGrid VO platform. 

Smaller and larger projects are also using the VO tools for their visualisa- 
tion capabilities. Such an example is Exo-Dat, the information system created to 
support CoRoT ( Deleuil et al.l [20091 ). Exo-Dat has implemented various IVOA 
protocols and output formats and allow for a direct interfacing with VO tools 
such as Aladin and TOPCAT, avoiding thus having to develop its own visual- 
isation tools that would overlap in functionality with that of the existing VO 
applications. 



6. Conclusions 

The scientific usage of the VO tools and infrastructure is now taking up, and 
this is reflected by the increasing number of refereed and other publications 
making use of the VO capabilities. The EURO-VO and other national VO 
initiative are here to help interested groups to carry out VO-enabled research 
and provide support for projects that want to publish their data in the VO. 
Dedicated initiatives such a workshops and Schools are organised world-wide 
in order to make astronomers aware of the grand potential of the VO and the 
scientific possibilities it offers. 
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